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Abstract: In the presence of loss and gain, the coupled mode equation 

on describing the mode hybridization of various waveguides or cavities, 
or cavities coupled to waveguides becomes intrinsically non-Hermitian. 
In such non-Hermitian waveguides, the standard coupled mode theory 
fails. We generalize the coupled mode theory with a properly defined 
inner product based on reaction conservation. We apply our theory to the 
non-Hermitian parity-time symmetric waveguides, and obtain excellent 
agreement with results obtained by finite element fullwave simulations. The 
theory presented here is typically formulated in space to study coupling 
between waveguides, which can be transformed into time domain by 
proper reformulation to study coupling between non-Hermitian resonators. 
Our theory has the strength of studying non-Hermitian optical systems 
with inclusion of the full vector fields, thus is useful to study and design 
non-Hermitian devices that support asymmetric and even nonreciprocal 
light propagations. 
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1. Introduction 

Coupled mode theory (CMT) can be traced back to 3 decades ago HI, and has been an in¬ 
dispensable tool of analyzing and designing waveguides, resonators, couplers or many other 
devices from microwave frequency |I30 to optical frequency IfSl tTTI , in both time and space 
domain. CMT has its mathematical root of variational principle lfT4ll . which yields a stationary 
value for target physical quantities of the coupled optical systems, such as propagation constant, 
eigen-frequencies, or impedance and so on. CMT was invented to study parametric amplifiers, 
oscillators, and frequency converters in microwaves 13, with a rigorous derivation given by 
Schelkunoff using mode expansion IT], and Haus using a variational principle Q. The CMT 
for optical waveguides was developed by many authors EM, and further refined by Hardy et 
al., using non-orthogonal coupled mode formulation (NCMT) BH, in contrast to the orthog¬ 
onal coupled mode theory (OCMT). Subsequently, temporal coupled mode theory was applied 
to study coupling among optical resonators, or resonators coupled to waveguides lHIHIl. 

It can be proved that CMT is exactly equivalent to Maxwell’s equations, as long as a com¬ 
plete set of modes is taken in the mode expansion in constructing the coupled mode equations. 
In most cases, we only need a few modes (usually two) in the mode expansion, because all 



the other modes, for instance, the continuum modes in waveguides, do not couple to the tar¬ 
get mode that we are interested in. Even though CMT remains approximate in the truncated 
mode set, yet insightful and often accurate description of target quantities of a coupled sys¬ 
tem can be obtained. By excluding those irrelevant modes, CMT for a two-modes structure 
can be reduced into a form of 2x2 matrix, which is a classic model and has been extensively 
used in describing many different coupled physical systems, such as two coupled mechanical 
oscillators, light (single-mode ) matter (single atom) interaction in Jaynes-Cummings model, 
two coupled resonators in electric circuits and so on. Mostly, for a lossless system, the energy 
flow from one mode/oscillator/atom to another mode/oscillator/photon is reciprocal, namely, 
the energy will flow back in a ’reciprocal’ way as ’time’ is reversed, which renders the coupled 
system Hermitian, or the transition matrix of the coupled system as a Hermitian Matrix. 

The aforementioned formulations of CMT rely on a definite and conserved optical power of 
the whole coupled system for either OCMT or NCMT, which essentially by default selects a 
scheme of complex inner product. However, for coupled optical parity-time (,^,;?')-symmetric 
structures lfT5l425l the total integrated power is not a conserved quantity, especially in the bro¬ 
ken phase after the exceptional point. Therefore, the standard CMT fails in such non-Hermitian 
systems (na. In iia, the authors formulate a CMT to studying J^^iT-symmetric structures 
through a Lagrangian treatment, in which the vector optical held is approximate as a scalar held 
envelope. In this work, we provide a conceptually simple model to construct general coupled 
mode theory (GCMT) using reaction conservation, a well-known concept in electromagnetism. 
The GCMT is capable of capturing the vectorial nature of the optical fields. Particularly, us¬ 
ing a scalar inner product, we prove that the Maxwell equation remains self-adjoint, even in the 
presence of loss and gain. We further construct GCMT based on perturbation. In the application 
of GCMT, we study i3^,^^-symmetric waveguides with balanced losses and gain. The predic¬ 
tion of our theory shows excellent agreement with results obtained by the fullwave simulation 
(COMSOL). 

The paper is organized as follows. In Section 2, we give the foundation of our general coupled 
theory based on reaction concept. Secondly, we discuss the procedures of constructing GCMT. 
In Section 3, we study the mode dispersion of J^^iT-symmetric waveguides using GCMT, and 
compare it with conventional coupled mode theory. Finally, Section 4 concludes the paper. 


2. General couple mode theory 

2.1. Reaction concept and self-adjointness 

The reaction is a physical observable introduced by Rumsey ll26l to measure the reaction be¬ 
tween sources Sa and St,, defined as follows. 


{Fb,Sa) = {Fh,aSa) 


( 1 ) 


where Fj, is the held generated by source Si,. As seen from Eq. O, the inner-product (-I-) is 
defined for the vector held, and equals (•O’-), where (0(r)|^/'(r)) = / dr(^'^(r)\^{r) for two 
column vector fields 0 (r), V^(r). It is important to point out that there is no complex conjugate 
operation over the fields, therefore the relation (^(r)|l/r(r)) = (t/r(r)|0(r)) holds. Denoting 
the held F = [E,HY , and S = [J,MY , J (M) electric (magnetic) current density. The reaction 
given in Eq. ([T]) can be explicitly given by 


(Fi,5„) = Jdr[ Eb Hb 


Ja 

M„ 


( 2 ) 


where the metric tensor a = 


i 0 

0 -I 


1 denotes the identity matrix, 0 the zero matrix. 


The link between the source S and the associated vector fields F is given by the Maxwell’s 




equations, 


(3) 


where L = 


Vx 

ikoBf 


ikotlr 

Vx 


LF = WS 

0 -i 

i 0 


\,w = 

Based on the reaction concept, the reciprocity theorem can be given 


(4) 


which states that the response of one source to the external field induced by another source 
equals to the response of the second source to the field given by the first source. Equation (IHi 
imposes certain constraints on the material parameters as given by Er{r) = (r) and /i^(r) = 

Such medium is called reciprocal medium in the literature, and could be lossy or active, 
as long as reciprocal conditions are fulfilled. We can reformulate Eq. Q in a matrix form as 
follows, 

HF = 5, (5) 

where H = W ^L. Eor any reciprocal medium, we find that for any two vector fields Xjf, 0, the 
following relation holds; 

= ( 6 ) 

Equation (|6]l means that the operator H is self-adjoint in the scheme of the inner-product given 
by Eq. ([T]i, which has relevant consequences and implications that we intend to discuss. Eirstly, 
the self-adjointness of the operator H and the reaction for defining reciprocity share the same 
definition on the inner-product. Moreover, the self-adjointness of H is equivalent to reciprocity 
theorem. In other words, the reciprocal conditions on the material parameters are necessary 
and sufficient condition both to self-adjointness of H, and to reciprocity theorem. Secondly, the 
self-adjointness of the Hamiltonian H guarantees that the underlying space associated with the 
operator .& is complete, despite the existence of losses or gain in the material parameters. Lastly, 
the left eigenvector space (bra space (-1) and the right eigenvector space (ket space 1)) are 
closely related with each other for self-adjoint operator IZTl . namely, one determines the other, 
and vice versa. Eor complex inner-product, the self-adjoint operator is conventionally called 
as Hermitian operator. The matrix form of right and left eigenvectors are complex conjugate 
and transpose to each other, i.e., j-) = [((-1)^]*. As for a scalar inner-product, the matrix form 
of self-adjoint operator is symmetric, hence the left and right eigenvectors has the relation of 


2.2. Dimension reduction: a non self-adjoint formulation for waveguide problems based on 
variational principles 

It is important to realize that the self-adjointness of the Maxwell’s equations in reciprocal 
medium, as given in Eq. is valid for the inner-product defined in 3D space. As for the 
development of GCMT or the modal solver based on variational principle for waveguide prob¬ 
lems, it is necessary to reduce the 3D formulation into its 2D counterpart. 

Considering an infinitely long waveguide, the waveguide modes of the original problem 
are given by 0 = [e(r),/i(r)]^, where e{r) = e(x,y, h{r) = 

The adjoint fields are given hy Iff = [e“(r),h‘‘{r)]'^, where e“{r) = e“{x,y)e‘^“’^P^\ h“{r) = 
We choose counter-propagating modes with the same j3 as our mode sets 
0 = and respectively. The relation between the 

fields propagating in -\-z and —z direction is given by = {ex,ey,e^}, — {hx,hy,h^}, 

e = {ex,ey,—e^}, h = {—hx,—hy,hz}- The particular choice of the mode sets 112814311 is 
relevant: (1) it is necessary to get a functional of coupled modes that is independent of z, mean¬ 
ing the terms in the inner-product between the modes sets tp and Xjf are canceled; (2) the 


mode pair of counter-propagating modes has a definite relation, hence <j> can be deduced from 
ijf, and vice versa. It is easy to get the z-independent and source-free wave-equation for the 
mode profiles ^ 2 d = of the original problem as follows 


H2d^2d = 0 


(7) 


where H 2 d = 


Mr" 


V, X -ijizx ikoH^. 
—ikoEr V, X —ijizx 


and s“ = 


ef- 


and ju" 


, and g'' (/rf) denotes the 2x2 in-plane components of electric (magnetic) di¬ 
electric function, V, =x^ From Eq. Q, and the predefined modes of ^ and one can 

also obtain the adjoint system from the original ones {H 2 d), as given by 


where ^ 2 ^ = [^ = 


^IdWld — 0 

V, X +il5zx ikop." 

—ikoS" Vr X +ifizx 


£ = 


-£'■ 


-Er 


(8) 

and 


Pr 


Mr 
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. It is easy to find that the following relation holds 


iV2d^^2d<l>2d) — {f^2dV2dT^2d)j (9) 

where the inner-product is carried out over 2D computational domain, i.e., transverse plane of 
the waveguides. As regards to the comparison between 2D formula (Eq. ®) and 3D formula 
(Eq. (|6]l), a few remarks may deserve attentions. Eirstly, the operator H 2 d with inner product 
defined over 2D domain is not self-adjoint Il30l432l . e.g., H'^ ^ H 2 d, for reciprocal medium. 
Secondly, for self-adjoint system, the adjoint system and the original system can be treated 
separately. As for a non self-adjoint electromagnetic problem ll28l . one need to solve original 
problem, as well as its adjoint problem simultaneously to provide a complete but biorthogonal 
mode sets to construct the coupled mode equations, or any other modal solver based on vari¬ 
ational principles, e.g., method of moments (MoM) and finite element method (EEM). As for 
Eq. (|9]l, it is necessary to obtain a unified variational form that contains contribution from both 
H 2 d and H 2 d- The explicit unified variational form for eigen-mode problem of waveguides is 
the following, 

Y = {5xif2d,H2d(l>2d) + iHld¥2d,S<l>2d)=0. (10) 

which is the first variation, e.g., dip 2 d and to the functional I = {'^2d^^2d^2d)- Accord¬ 

ing to variational principle, Eq. (flOl i indicates simultaneously the optimal solution of and 
(l> 2 d to Eq. (I7]i and Eq. ®, as long as the functional / is stationary for any \lf 2 d and <j> 2 d- Thirdly, 
the mode set (\lf 2 d) of the adjoint system is selected as the countering propagating modes of the 
original system {^ 2 d^- As such, the total degree of freedom of unknows is halved OOl . We note 
that GCMT can also be applied to bianisotropic waveguides m- 

2.3. Procedures of constructing GCMT for waveguide problem based on perturbation 

Given a perturbation to the adjoint system by AH, i.e., = ^id + AH, we shall have 

the relation according to linear response of Maxwell’s equations, {\p 2 diH 2 d^ 2 d) — ~ 

AH]v^2£ti02rf)’ which can be reformulated as 

i¥2d^^2d4'2d) — i^2d¥2dT^2d) = { — ^¥2d^^2d)- 


( 11 ) 


Perturbation implies that AH is small, hence can be approximately taken as 0, 

which leads to 

~ (ftldWldi^ld) = 0- (12) 

Equation (fTSl i gives the connection between the original system and the perturbed adjoint sys¬ 
tem via the reaction conversation under a small perturbation AH, and can be transcribed into a 
set of coupled mode equations, which is essentially the GCMT proposed in this paper. Firstly, 
we use normalized fields e{r) = h{r) = propagating in the +z direc¬ 

tion, satisfying Maxwell equations 

Vt X - ipojZ X (13a) 

V, X hQj - iPodZ X h^- = ikoE^e^j, (13b) 

where the subscripts 0 and i stand for no perturbation case and mode labels, respectively. 

We consider there is a small perturbation of e. The idea is that the fields under perturbed sys¬ 
tem can be approximated by a linear combination of the unperturbated fields of the adjoint sys¬ 
tems. Therefore, the fields of perturbed system could be written as e'{r) = (Ey-ay-eQ 
h'{r) = (LjajhQ and satisfy 

V, X 'i^jajeQ j -I- iPz x T.jajeQ j = -ikofi,.I.jajhQ j, (14a) 

V, X T.jajhQ j -f ijSz X ^jajhQ j = ikoEr'LjajeQ j, (14b) 

In equivalence with Eq. (fT2l i. we derive GCMT from perturbation as follows, 

JJ {Eq. (fT3b ) ■I.jajhQ j — Eq. (flTb ) -e^j+Eq. (fT3l ?) ■I.jajeQ j — Eq. ^\Mi] ■ .Adxdy (15) 

In case that a small perturbation is present in the imaginary part of Er, i.e. Er = E^y+ iAe{x,y), 
the formula resulted from Eq. (fTSl) can be simplified as follows, 

-KP- Po.dPij] =0 (16) 

where fi,y = ff{V, ■ (h+ x e^j) - V, • (hoj x e^^pjdxdy, pij = ff{z ■ (e^j x h+) - z ■ (ej,- x 
j)}dxdy, kij = iko JJ Ae{x,y)ejj ■ e^jdxdy. For Ae(x,y) = 0, we shall have the following 
relation 

b,j = i{poj-po.i)ptj- (17) 

Inserting bij back into Eq. (flbl l yields 

'LjOjOi — Pq j)pij = Hjajkij. ( 18 ) 

Equation (fTSl l is the matrix form of GCMT proposed in this paper, which will be used to study 
Hermitian and non-Hermitian waveguide with some concrete examples in the following section. 

It is worthy to write down the CMT derived from conventional CMT (CCMT) fS] for com¬ 
parison. To this end, Eq. (fT^ shall be reformulated as 

iWldMid) - (iH*¥2dr,<t>2d) = 0, (19) 

where * indicates the operation of complex conjugation, and the metric tensor is modified as 

f I 0 \ 

a = ( Q j I accordingly. Following the same procedure, we have 


EyOy-j^ P0.j)Pij — 


(20) 
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Fig. 1. Real part and imaginary part of effective mode indices («e//) versus Ae using GCMT 
(a,b) and CCMT (c,d). Gray solid lines are calculated from fullwave simulation. Inset shows 
the schematic diagram of two coupled core layers with loss and gain, sun'ounded by air. 
Dimensions are h = 0.2Ao, w = 0.3Ao, d = O.OSAg (blue solid circles, green open circles) 
or O.O 5 A 0 (red diamonds, magenta crosses). Eg = 10. Aq is vacuum wavelength. 


where bij = ff{V, ■ (/i+. x e^j) + V, • {h*o j x e+j)}dxdy, pij = ff{z ■ (e^j x + z ■ (e+. x 
hQ j)}dxdy, kij = ikg JJ Ae{x,y)eQ j ■ e^-dxdy. In this case, for Ae{x,y) = 0 , bij + = 

‘ {pQ.j - Pij where fij = -ikg + filo)h* ■ hf + (Cr.o + e^o ' 4}dxdy. We will 

show in the next section that, CCMT works fine for Hermitian waveguides, but not for the case 
where non-Hermitian waveguides are considered. 

3. Results and Discussions 

In the following, we use Eq. (fTsT i to analyze dispersion relations in J^ciT-symmetric waveg¬ 
uides as discussed in MM- The structure of ,^,(r-symmetric waveguides is shown by the 
inset of Fig. [T] It is composed of two waveguides with identical geometry dimensions placed 
close to each other. It is well known that the pair of an even and odd super mode is formed in 
this case. Phase transitions can be observed as the magnitude of the imaginary part of Er of two 
waveguides, i.e., Er = £,-.0 + in core layer 1 and £r = Erfl — iAe in core layer 2, crosses a crit¬ 
ical value as shown by the inset. This is used to create a symmetric index guiding profile and an 
anti-symmetric gain-loss profile. The gain/loss perturbation creates coupling between the odd 
and even mode pair so that the effective index of the two supermodes becomes closer and closer 
until an exceptional point where they become identical. Beyond the exceptional point, the real 
part of rieff remains the same, but the imaginary part of of two modes break into two 
branches. When two waveguides are put more closer, the modes of two waveguides coupled 
more intensely so the even and odd supermodes have larger separation in the effective mode 
index. Therefore, it needs larger gain/loss parameter to get to the exceptional point. These are 
confirmed by COMSOL where two different gap size between two core layers are considered, 
as shown by the gray lines of Fig. [T] 

Then we apply our theory to predict the dispersion curves of the above mentioned structures. 
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Fig. 2. Real part of n^ff versus Ae. Gray solid lines are calculated from fullwave sim¬ 
ulations. Blue open circles (red crosses) represent results derived by GCMT(CCMT). 
d = O.O 3 A 0 . Other parameters are the same as Fig. [T] 


In this case, Eq. (fTSl l is an eigenvalue problem with the following form 

Po.iPii-ikii PoaPn —ikn 
_ Po.iP2i - ihi Po,2P22-ik22 

where pij and kij are dehned in previous section. As a starting point, we use the mode helds 
provided by COMSOL at Ae = 0. Propagation constants as well as eigenvectors are updated 
according to Eq. dlTTi using mode helds at this point. Next, in-plane helds are recalculated 
using updated eigenvectors and used for deriving propagation constants as well as eigenvectors 
in the next step. By choosing a small step, full dispersion relations as a function of Ae can 
be resolved. The solid symbols in Eig. [Ha) and[TJb) show effective mode indices given by 
tigff = p /k() derived by our method, where they match results from COMSOL remarkably 
well. Eor comparison, we also show the results obtained by CCMT in Eig.lTJc) andlTJd). In this 
case Eq. (l 2 T]) becomes 


ai 

= P 

Pil 

Pn 


a\ 

0-2 


P2l 

P 22 


0-2 


■ j3o ipii - ikn jSo 2P12 - ikn 
j3o 1 P 2 I — *^21 i3o,2P22 — ik22 

It is clear that in this case CCMT fails to capture the major feature of ,?3^,(^-symmetric waveg¬ 
uides. However, instead of perturbing in the imaginary part, CCMT works hne for the case 
when Er is present the real part. Red crosses in Eig. |2|a) shows the case that Ae is real and 
increases with identical sign in both core layers. In this case, the separation between two mode 
indices remain the same but their absolute values increases as Ae increases. Red crosses in Eig. 
|2lb) shows the case that Ae is real but increases with opposite sign in two core layers. In this 
case, two mode indices are further separated as Ae increases, indicating anti-crossing features. 
In both hgures, only real part of tigff is shown since imaginary part of iigff in all cases is zero. 
Dispersion relations calculated according to GCMT are also shown in Eig. |2]with blue open 
circles. Clearly, GCMT developed in this work gives the same results as CCMT does, agreeing 
well with full wave simulations given by gray lines shown in Eig. |2] 
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4. Conclusions 


From reaction conservation, we provide a solid foundation for the construction of general cou¬ 
ple mode theory that can handle mode hybridization in non-Hermitian waveguides. Using a 
scalar inner product, we establish the equivalence between the self-adjointness of Maxwell’s 
equations and reaction conservation. As for waveguide problems, the dimension of the self- 
adjoint relation need to be reduced from 3D to 2D, in which the formula turns out be non self- 
adjoint problem. Using coutering-propagating modes as the dual space of 2D non self-adjoint 
waveguide problem, the eigenmodes can be resolved from variational principles. Importantly, 
the 2D non self-adjoint relation can be elaborated into a set of coupled mode equation. We give 
a detailed discussion of the dimensional reduction for waveguide problem that relies on vari¬ 
ational principle. We then provide a procedure of constructing GCMT for waveguide problem 
based on perturbation, which yields a set a coupled mode equations. To illustrate the effec¬ 
tiveness of GCMT developed in this work, it is applied to study the phase transition of coupled 
,^,iF-symmetric structures and shows excellent agreement with fullwave simulations. For com¬ 
parison, results derived from CCMT are also shown, which fails to capture the major features 
of waveguides. Our theory provide direct analysis of eigenvalues of ,^,ir-symmetric 
structures with in-cooperation of full vector helds. Thus it might be useful to study and design 
non-Hermitian devices that support asymmetric and even nonreciprocal light propagation. 
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